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In re application of: 

HOGLUND et al. 
U.S. National Phase of PCT/SE00/01767 
Entry papers filed herewith March 15, 2002 
For: DNA CONSTRUCT AND ITS USE 



Attention: PCT OFFICE 



PRELIMINARY AMENDMENT 
AND INFORMATION DISCLOSURE STATEMENT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

The present application is the U.S. national phase of international application 
number PCT/SEOO/01767. The following amendments pertain to the claims as 
amended. 

Please note that the amended page 1 (new claim set) attached to the 
International Preliminary Examination Report (Annexes) and submitted herewith, have 
replaced the originally filed page 8 of the application. The claims to be examined and 
amended by this preliminary amendment are found on amended page 1 (of the new 
claim set). 

Please amend the above-identified application as follows: 



IN THE SPECIFICATION: 



Please add the attached ABSTRACT OF THE DISCLOSURE to the application. 
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IN THE CLAIMS : 

Please replace claims 3 and 5-6 with the following amended claims. 

3(Amended). Transgenic oilseed plant cell according to claim 1 , wherein the 
promoter is a napin promoter, the peptide with enzyme activity necessary for keto group 
containing xanthophyll production and esterification is selected from the group 
consisting of peptides with, 1-D-deoxyxylulose 5-phosphate synthase, isopentenyl 
pyrophosphate:dimethylallyl pyrophosphate isomerase, geranylgeranyl pyrophosphate 
synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, 
lycopene beta-cyclase, p-carotene hydroxylase, and acyl transferase activity. 

5(Amended). Transgenic oilseed plant cell according to claim 1, wherein the 
oilseed plant is selected from the group consisting of rape, sunflower, soybean and 
mustard. 

6(Amended). Transgenic oilseed plant cell according to claim 1 , wherein the cell 
expresses xanthophyll. 

Please add the following new claims to the application. 

10(New). Transgenic oilseed plant cell according to claim 2, wherein the 
promoter is a napin promoter, the peptide with enzyme activity necessary for keto group 
containing xanthophyll production and esterification is selected from the group 
consisting of peptides with, 1-D-deoxyxylulose 5-phosphate synthase, isopentenyl 
pyrophosphate^ imethylallyl pyrophosphate isomerase, geranylgeranyl pyrophosphate 
synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, 
lycopene beta-cyclase, p-carotene hydroxylase, and acyl transferase activity. 
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11 (New). Transgenic oilseed plant cell according to claim 2, wherein the 
oilseed plant is selected from the group consisting of rape, sunflower, soybean and 
mustard. 

12(New). Transgenic oilseed plant cell according to claim 3, wherein the 
oilseed plant is selected from the group consisting of rape, sunflower, soybean and 
mustard. 

13(New). Transgenic oilseed plant cell according to claim 4, wherein the 
oilseed plant is selected from the group consisting of rape, sunflower, soybean and 
mustard. 

14(New). Transgenic oilseed plant cell according to claim 10, wherein the 
oilseed plant is selected from the group consisting of rape, sunflower, soybean and 
mustard. 

1 5(New). Transgenic oilseed plant cell according to claim 2, wherein the cell 
expresses xanthophylls. 

16(New). Transgenic oilseed plant cell according to claim 3, wherein the cell 
expresses xanthophylls. 

17(New). Transgenic oilseed plant cell according to claim 4, wherein the cell 
expresses xanthophylls. 

1 8(New). Transgenic oilseed plant cell according to claim 5, wherein the cell 
expresses xanthophylls. 

1 9(New). Transgenic oilseed plant cell according to claim 1 0, wherein the cell 
expresses xanthophylls. 
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20(New). Transgenic oilseed plant cell according to claim 1 4, wherein the cell 
expresses xanthophylls. 
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U.S. National Phase of PCT/SE00/01767 
REMARKS 

Applicants have amended the claims in order to reduce the initial filing fee by 
deleting the multiple dependent claims from the application. Some of this subject 
matter has been reintroduced as dependent claims 10-20. Applicants retain the right 
to reintroduce any subject matter canceled by the present Amendment at any time 
during the prosecution of this application or any further application claiming benefit of 
this application. 

Applicants have amended the application to substitute the originally filed page 
8 with the amended claim set page 1 attached to the International Preliminary Examiner 
Report (Annexes) and included in the application as filed herewith. Also, an Abstract 
of the Disclosure has been added to the application. 

Applicants are submitting herewith a copy of the International Search Report 
which issued on International Application No. PCT/SE00/01767, of which the present 
application is the U.S. national phase which was published in English. All of the 
publications cited in the International Search Report are listed on the attached Form 
PTO-1449. It is Applicants 1 understanding that, under the procedures of the PCT, 
copies of the cited publications will have been supplied to the U.S. Patent Office by the 
International Bureau. However, the Examiner is invited to contact the undersigned 
attorney if additional copies are necessary or would facilitate examination of the present 
application. 

Otherwise, the Examiner is respectfully requested to return an initialed and dated 
copy of the attached Form PTO-1449 to confirm that all publications listed thereon have 
been considered and made officially of record in the file of this application. 

Applicants understand that, under the procedures of the PCT, a copy of the 
priority document (SE 9903336-7, filed 17 September 1999) will have been supplied to 
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the U.S. Patent Office pursuant to Rule 17 of the PCT Regulations. It is therefore 
respectfully requested that the first Official Action in the present application contain an 
indication that the appropriate priority document is in the file of this application. 

In view of the above amendments, an early action on the application is now in 
order and is most respectfully requested. 



625 Slaters Lane - 4th Floor 
Alexandria, Virginia 22314 
Phone: (703) 683-0500 
Facsimile: (703) 683-1080 

REF:kdd 



Respectfully submitted, 
BACON & THOMAS, PLLC 




Richard E. Fichter 
Registration No. 26,382 
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U.S. National Phase of PCT/SE00/01767 
Marked-Up Version Showing Changes Made 

IN THE CLAIMS : 

Please replace claims 3 and 5-6 with the following amended claims. 

3(Amended). Transgenic oilseed plant cell according to claim 1 [or 2], wherein 
the promoter is a napin promoter, the peptide with enzyme activity necessary for keto 
group containing xanthophyll production and esterification is selected from the group 
consisting of peptides with, 1-D-deoxyxylulose 5-phosphate synthase, isopentenyl 
pyrophosphate:dimethylallyl pyrophosphate isomerase, geranylgeranyl pyrophosphate 
synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, 
lycopene beta-cyclase, p-carotene hydroxylase, and acyl transferase activity. 

5(Amended). Transgenic oilseed plant cell according to [any one of claims 1-5] 
claim 1 . wherein the oilseed plant is selected from the group consisting of rape, 
sunflower, soybean and mustard. 

6(Amended). Transgenic oilseed plant cell according to [any one of claims 1 - 5] 
claim 1 . wherein the cell expresses xanthophyll. 
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29295/BN Abstract 

A DNA construct comprising in the 5' to 3' direction of transcription operably 
linked a promoter region directing transcription to the seed of an oilseed plant, a 
5 nucleotide sequence coding for at least one peptide with enzyme activity necessary for 
keto group containing xanthophyll production and esterification in an oilseed plant and 
a transcriptional termination region is disclosed. The DNA construct may additionally 
comprise a nucleotide sequence coding for a transit peptide directing the translated 
fusion polypeptide to the chloroplast of the oilseed plant. The peptide with enzyme 
10 activity is preferably a peptide with b-carotene C-4-oxygenase activity, e.g. from the 
alga Haematococcus pluvialis. 

Comprised by the invention are also a transgenic oilseed plant cell, e.g. of rape, 
sunflower, soybean or mustard origin; transgenic oilseed plant-produced xanthophyll; 
transgenic oilseed plant-produced canthaxanthin; transgenic oilseed plant-produced 
15 astaxanthin; and transgenic oilseed plant-produced astaxanthin esters. 
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DNA construct and its use. 
The present invention relates to a new DNA construct for transformation into oilseed 
plants. The DNA construct comprises nucleotide sequences encoding peptides with enzyme 
activities necessary for the high-level production and esterification of keto group-containing 
xanthophylls in oilseed plants. 

Background of the invention 

Carotenoids are produced de novo by plants, fungi, algae and some bacteria. A 
number of biosynthetic steps are needed for the biological production of the carotenoids. 
There are two chemically different groups of carotenoids, namely carotenes containing only 
carbon and hydrogen molecules and xanthophylls containing oxygen in the molecule in 
addition to carbon and hydrogen. 

The xanthophylls, and particularly astaxanthin (33*-dihydroxy-P-p-carotene-4,4'- 
dione), are often colored pigments and are used as such or as anti-oxidants. 

Carotenes are biological precursors for the production of the oxygen-containing 
xanthophylls. There are two types of enzymes responsible for the introduction of hydroxy 
groups and keto groups into the carotenes, namely hydroxylases and ketolases, respectively. 

The keto group-containing xanthophyll astaxanthin, which has keto and hydroxy 
groups, is biosynthetically produced from beta-carotene. 

Large-scale production of xanthophylles from natural sources is at present performed 
by AstaCarotene AB, Gustavsberg, Sweden, by cultivation of the alga Haematococcus 
pluvialis for the production of astaxanthin in esterified form. 

It would be desirable to be able to produce keto group-containing xanthophylls 
particularly astaxanthin, in oilseed plants. Oilseed plants have naturally p-carotene 
hydroxylases but lack P-carotene C-4-oxygenase enzymes or ketolases. 

Description of the invention 
The present invention provides DNA constructs enabling and promoting 
production of keto group containing xanthophylls, especially astaxanthin, in oilseed plants, 
such as rape, sunflower, soybean and mustard. The DNA construct is transformed into the 
oilseed plant cell for expression of a protein or fused protein which has an enzyme activity 
enabling keto group insertion into a carotene or hydroxy carotene for the biosynthetic 
production of a keto group containing xanthophyll, such as cantaxanthin (P,P-carotene-4,4 > - 
dione) and/or astaxanthin. Use is thus made of the biosynthetic pathway of the oilseed plant to 
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produce carotenoids. The naturally occurring synthesis of carotenoids involves a number of 
enzymes, namely 1-D-deoxyxylulose 5-phosphate synthase, isopentenyl 
pyrophosphate.dimethylallyl pyrophosphate isomerase, geranylgeranyl pyrophosphate 
synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, lycopene beta- 
cyclase, p-carotene hydroxylase, and P-carotene C-4-oxygenase. Genes coding for peptides 
having these enzymatic activities may be inserted into the DNA construct of the invention, 
one or several per construct, to promote high-level production in the transgenic oilseed plant. 
In case only one enzyme coding gene is inserted per plant, two or more plants may be 
sexually interbred to produce plants containing all the desired enzyme activities. 

Thus, the present invention is directed to a DNA construct comprising in the 5' to 3' 
direction of transcription operably linked a promoter region directing transcription to the seed 
of an oilseed plant, a nucleotide sequence coding for at least one peptide with enzyme activity 
necessary for keto group containing xanthophyil production and esterification in an oilseed 
plant and a transcriptional termination region. 

In a preferred embodiment of the invention the DNA construct additionally 
comprises between the promoter region and the nucleotide sequence coding for at least one 
peptide with enzyme activity a nucleotide sequence coding for a transit peptide directing the 
translated fusion polypeptide to the chloroplast of the oilseed plant. 

The DNA construct is preferably such that the promoter is a napin promoter, the 
peptide with enzyme activity necessary for keto group containing xanthophyil production is 
selected from the group consisting of peptides with 1-D-deoxyxylulose 5-phosphate synthase, 
isopentenyl pyrophosphate.dimethylallyl pyrophosphate isomerase, geranylgeranyl 
pyrophosphate synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, 
lycopene beta-cyclase, p-carotene hydroxylase, and P-carotene C-4-oxygenase activity. To 
promote esterification of astaxanthin a nucleotide sequence coding for a peptide with acyl 
transferase activity may be included in the group. 

In a preferred embodiment of the DNA construct according to the invention the 
nucleotide sequence coding for a peptide with enzyme activity is a nucleotide sequence 
coding for a N-terminally truncated P-carotene C-4-oxygenase gene from the alga 
Haematococcus pluvialis. 

An example of the DNA construct of the invention is presented in the sequence 
listing as SEQ ID NO: 1 and in Fig. 1 . 
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The present invention is also directed to a transgenic oilseed plant cell 
, comprising the DNA construct of the invention, and preferably the oilseed plant is selected 
from the group consisting of rape, sunflower, soybean and mustard. 

The invention is additionally directed to transgenic oilseed plant-produced 
xanthophyll, e.g. canthaxanthin and astaxanthin. 

A preferred aspect of the invention is directed to transgenic oilseed plant- 
produced astaxanthin esters. 

The present invention will now be illustrated with reference to the DNA 
construct disclosed in the sequence listing and in Fig.l, and the following description of 
embodiments. However, the invention is not limited to these exemplifications. 

Short description of the drawings 
Fig.l illustrates the nucleotide sequence of the DNA construct comprising the napin promoter, 
the chloroplast localization signal, the N-terminally truncated P-carotene C-4-oxygenase gene 
and the termination sequence, and the deduced amino acid sequences of the transit peptide 
and the p-carotene C-4-oxygenase. 

Description of embodiments 
The invention is illustrated by production of astaxanthin in the seed of oilseed 
rape. The astaxanthin produced in the seed of the transgenic plant is extracted as part of the 
extracted oil. By use of conventionally used protocols for Agrobacterium tumefaciens 
mediated transformation such as described by (Hoekema et al.1983, An et al. 1986, Fry et al. 
1987, DeBlock et al. 1988, Radke et aL1988, or Moloney et al. 1989) transgenic plants are 
produced having a chimeric DNA construct that is genetically inherited and is able to produce 
astaxanthin. The nucleotide sequence of the chimeric DNA construct consist of four parts of 
different genetic origin namely: (1) a promoter, (2) a localization signal, (3) a p-carotene C-4- 
oxygenase coding region and (4) a termination sequence. 

The napin promoter directs transcription to the seed of oilseed rape (Stalberg et 
al 1996). This promoter was coupled to a localization signal similar but not identical to a 
transit peptide (TP) of Rbcsla (Krebbers, 1988) that directs the translated product of a fused 
gene to the chloroplast. The promoter and the TP sequence were ligated to a part of the coding 
sequence of a ketolase gene BCK (Kajiwara et al. 1995). This enzyme oxygenates P-carotene 
to canthaxanthin, (Fraser et al. 1997). The chimeric DNA construct was then coupled to a 
suitable termination sequence, e.g. that of the Agrobacterium tumefaciens nopaline synthase 
gene (the nos 3 ' end)(Bevan et al. 1 983), as illustrated in Fig. 1 . 



WO 01/20011 PCT/SE00/01767 

4 

Cellular storage of Astaxantin 

The storage of large amounts of free astaxanthin in plants will be difficult due to 
toxic effects of the molecule as it intercalates in the plant membranes. An effective 
esterification of astaxanthin to fatty acids enables storage of the esterified molecules in 
triacylglycerol containing oleosomes. Thus, an acyl transferase can be claimed to be of 
fundamental importance for the process, as is proteins that can mediate transport of different 
forms of astaxanthin from the chloroplast to the vesicles. 

Sequences and oligonucleotide s used in the constnir.Hon of the DNA construct 

1. Napin promoter (GeneBank ACCESSION No. J02798) 

This promoter sequence, a 1 145 base pair fragment including the 5' leader 
sequence has a unique Hindlll site at the 5' end. The 3' end was synthesized with an 
additionally 6 nucleotide BamHI site. 

2. Transit peptide similar to RBCSla (GeneBank ACCESSION No. XI 361 1, X14565) 

The transit peptide (TP) was amplified by PCR from -28 to the end of the transit 
cleavage aa=54/55 site of the Rbcsla gene. The 5' end was synthesized with a BamHI site 
and similarly the 3' sequence was synthesized with a Xbal site. The two following 
oligonucleotides were used for the PCR amplification. 

BamHI 

5 ' primer: TP 1 5 ' AGAC GGATCC TCAGTCACACAAAGAGTA 3 ' 

Sad Xbal 

3' primer: TP2 5 'GTTC GAGCTC TCTAGA CATGCAGTTAACGC 3 ' 

3. BCK (fi-carotene C-4 oxygenase) (Genebank ACCESSION No. D45881) 

The BCK fragment was amplified by PCR including a 5' Xbal site and was 
ligated to the TP already described. The 5' primer (BCK1) used for PCR, is homologous to 
the BCK sequence from nucleotide 264 and the 3' oligonucleotide (Ax40) ends with a stop 
codon and was synthesized with a Sad restriction site for cloning. The synthesized fragment 
was fused to the TP as shown in Fig 1. 
Oligonucleotides used for PCR: 

Xbal 

5 ' primer: BCK1 5 ACAG TCTAGA ATGCCATCCGAGTCGTCA 3 ' 

Sad 

3 'primer: AX40 5 'CACCGAGCTCCATGACACTCTTGTGCAGA 3 ' 
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Description of SEQ ID NO:l and SEQ ID NO:2 

The sequences shown i Fig.l are the same as the two sequences which are 
shown in the sequence listing. 

The SEQ ID NO:l is a nucleotide sequence composed of the following features: 

Nucleotide No. 
Cloning site Hindlll 1 -6 

Napin Promoter 1 - 1 1 45 

Cloning site BamHI 1 1 46- 1151 

Transit peptide leader 1 1 52-1 1 78 

Transit peptide coding 1 1 79- 1 347 

Cloning site Xbal 1 348- 1 353 

P-carotene C-4-oxygenase 1354-2217 
P-carotene C-4-oxygense 3' untranslated 2218-2266 
Cloning site SacI 2267-2272 
Nopaline synthetase termination 2273-2536 
Cloning site EcoRI 2538-2543 



The SEQ ID NO: 2 is a deduced amino acid sequence of the fusion protein of 
the transit peptide and the peptide with P-carotene C-4-oxygenase activity. 
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Claims 

1. Transgenic oilseed plant cell containing a DNA construct comprising in the 5' 
to 3 ' direction of transcription operably linked a promoter region directing transcription to the 
seed of the oilseed plant, a nucleotide sequence coding for a transit peptide directing the 
translated fusion polypeptide to the chloroplast of the oilseed plant, a S'-truncated beta- 
carotene C-4-oxygenase gene from the alga Haematococcus pluvialis and a transcriptional 
termination region. 

2. Transgenic oilseed plant cell according to claim 1, wherein the cell 
additionally contains at least one DNA construct selected from DNA constructs comprising in 
the 5' to 3' direction of transcription operably linked a promoter region directing transcription 
to the seed of the oilseed plant, a nucleotide sequence coding for a transit peptide directing the 
translated fusion polypeptide to the chloroplast of the oilseed plant, a nucleotide sequence 
coding for at least one peptide with enzyme activity necessary for keto group containing 
xanthophyll production and esterification in the oilseed plant and a transcriptional termination 
region. 

3. Transgenic oilseed plant cell according to claim 1 or 2, wherein the promoter 
is a napin promoter, the peptide with enzyme activity necessary for keto group containing 
xanthophyll production and esterification is selected from the group consisting of peptides 
with, 1-D-deoxyxylulose 5-phosphate synthase, isopentenyl pyrophosphaterdimethylallyl 
pyrophosphate isomerase, geranylgeranyl pyrophosphate synthase, phytoene synthase, 
phytoene desaturase, zeta-carotene desaturase, lycopene beta-cyclase, P-carotene hydroxylase, 
and acyl transferase activity. 

4. Transgenic oilseed plant cell according to claim 1, wherein the nucleotide 
sequence of the DNA construct is SEQ ID NO: 1 . 

5. Transgenic oilseed plant cell according to any one of claims 1-5, wherein the 
oilseed plant is selected from the group consisting of rape, sunflower, soybean and mustard. 

6. Transgenic oilseed plant cell according to any one of claims 1 - 5, wherein the 
cell expresses xanthophylls. 

7. Transgenic oilseed plant cell according to claim 6, wherein a xanthophyll is 
canthaxanthin. 

8. Transgenic oilseed plant cell according to claim 6, wherein a xanthophyll is 

astaxanthin. 

9. Transgenic oilseed plant cell according to claim 8, wherein the astaxanthin 
comprises astaxanthin esters. 
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Napin promoter 

AAGCTTTCTTCATCGGTGATTGATTCCTTTAAAGACTTATGTTTCTTATCTTGCTTCTGA 

GGCAAGTATTCAGTTACCAGTTACCACTTATATTCTGGACTTTCTGACTGCATCCTCATT 

TTTCCAACATTTTAAATTTCACTATTGGCTGAATGCTTCTTCTTTGAGGAAGAAACAATT 

CAGATGGCAGAAATGTATCAACCAATGCATATATACAAATGTACCTCTTGTTCTCAAAAC 

ATCTATCGGATGGTTCCATTTGCTTTGTCATCCAATTAGTGACTACTTTATATTATTCAC 

TCCTCTTTATTACTATTTTCATGCGAGGTTGCCATGTACATTATATTTGTAAGGATTGAC 

GCTTATTGAGCGTTTTTCTTCAATTTTCTTTATTTTAGACATGGGTATGAAATGTGTGTTA 

GAGTTGGGTTGAATGAGATATACGTTCAAGTGAAGTGGCATACCGTTCTCGAGTAAGGAT 

GACCTACCCATTCTTGAGACAAATGTTACATTTTAGTATCAGAGTAAAATGTGTACCTAT 

AACTCAAATTCGATTGACATGTATCCATTCAACATAAAATTAAACCAGCCTGCACCTGCA 

TCCACATTTCAAGTATTTTCAAACCGTTCGGCTCCTATCCACCGGGTGTAACAAGACGGA 

TTCCGAATTTGGAAGATTTTGACTCAAATTCCCAATTTATATTGACCGTGACTAAATCAA 

CTTTAACTTCTATAATTCTGATTAAGCTCCCAATTTATATTCCCAACGGCACTACCTCCA 



TATGAAGTTAAGTTTTTACCTTGTTTTTAAAAAGAATCGTTCATAAGATGCCATGCCAGA 

ACATTAGCTACACGTTACACATAGCATGCAGCCGCGGAGAATTGTTTTTCTTCGCCACTT 

GTCACTCCCTTCAAACACCTAAGAGCTTCTCTCTCACAGCACACACATACAATCACATGC 

GTGCATGCATTATTACACGTGATCGCCATGCAAATCTCCTTTATAGCCTATAAATTAACT 

CATCCGCTTCACTCTTTACTCAAACCAAAACTCATCAATACAAACAAGATTAAAAACATA 

End -2 8 untranslated leader TP start 

CACGAGGATCCTCAGTCACACAAAGAGTAAAGAAGAACAATGGCTTCCTCTATGCTCTCT 

MAS S • M Xi S 



TCCGCTACTATGGTTGCCTCTCCGGCTCAGGCCACTATGGTCGCTCCTTTCAACGGACTT 
SATMVAS PAQATMVAP FNGL 

AAGTCCTCCGCTGCCTTCCCAGCCACCCGCAAGGCTAACAACGACATTACTTCCATCACA 
KS SAAFPA TRKANNDI TSIT 



FIG.l 
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TP End C - 4 - Oxygenase 

AGCAACGGCGGACGCGTTAACTGCATGTCTAGAATGCCATCCGAGTCGTCAGACGCAGCT 
SNGGRVNCMSRMP SE S S DAA 

CGTCCTGCGCTAAAGCACGCCTACAAACCTCCAGCATCTGACGCCAAGGGCATCACGATG 
RPALKHAYKPPASDAKGITM 

GCGCTGACCATCATTGGCACCTGGACCGCAGTGTTTTTACACGCAATATTTCAAATCAGG 
A I* T I I GTWTAVFL HA I F Q I R , 

CTACCGACATCCATGGACCAGCTTCACTGGTTGCCTGTGTCCGAAGCCACAGCCCAGCTT 
LPTSMDQLHWLPV SEATAQL 

TTGGGCGGAAGCAGCAGCCTACTGCACATCGCTGCAGTCTTCATTGTACTTGAGTTCCTG 
LGGSSSLLHIAAVFIVIiEF L 

TACACTGGTCTATTCATCACCACACATGACGCAATGCATGGCACCATAGCTTTGAGGCAC 
YTGIj FITTHDAMH GT IALRH 

AGGCAGCTCAATGATCTCCTTGGCAACATCTGCATATCACTGTACGCCTGGTTTGACTAC 
RQLNDLLGNI CI S I* Y A W F D' Y 

AGCATGCTGCATCGCAAGCACTGGGAGCACCACAACCATACTGGCGAAGTGGGGAAAGAC 
SMLHRKHWEHHNH TG EVGKD 

CCTGACTTCCACAAGGGAAATCCCGGCCTTGTCCCCTGGTTCGCCAGCTTCATGTCCAGC 
PDFHKGNPGLVPWFASFMSS 

TACATGTCCCTGTGGCAGTTTGCCCGGCTGGCATGGTGGGCAGTGGTGATGCAAATGCTG 
j YMSLWQFARLAWWAVVMQML 

GGGGCGCCCATGGCAAATCTCCTAGTCTTCATGGCTGCAGCCCCAATCTTGTCAGCATTC 
4 f G A P M A N L L V F M A A A P I L. SAF 

CGCCTCTTCTACTTCGGCACTTACCTGCCACACAAGCCTGAGCCAGGCCCTGCAGCAGGC 
£ j JRLFYFGTYLPHKP EPGPAAG 

TCTCAGGTGATGGCCTGGTTCAGGGCCAAGACAAGTGAGGCATCTGATGTGATGAGTTTC 
(ty SQVMAWFRAKTS EASDVMSF 

CTGACATGCTACCACTTTGACCTGCACTGGGAGCACCACAGATGGCCCTTTGCCCCCTGG 
I | LTCYHFDLHWEHHRWPFAPW 

C-4 oxygenase Stop 
TGGCAGCTGCCCCACTGCCGCCGCCTGTCCGGGCGTGGCCTGGTGCCTGCCTTGGCATGA 

\Wqlphcrrlsgrglv pa|la* 



FIG.l (cont . ) 
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C-4 oxygenase untranslated region Nos term 

CCTGGTCCCTCCGCTGGTGACCCAGCGTCTGCACAAGAGTGTCATGGAGCTCGAATTTCC 

CCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCCGGTCTTG 

CGATGATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAATAATTAACATGTAAT 

GCATGACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATTTAAT 

ACGCGATAGAAAACAAAATATAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTCAT 
end 

CTATGTTACTAGATCGGGAATTC 



Fig. 1 (cont * ) 
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SEQUENCE LISTING 



<110> AstaCarotene AB 



<120> DNA construct and its use 

<130> 29295 -AstaCarotene 

<140> 
<141> 

<160> 2 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 2543 
<212> DNA 

<213> Artificial Sequence 
f <220> 

<223> Description of Artificial Sequence: napin promoter 

+ choroplast localization signal + beta-carotene C-4 oxygenase 
coding sequence + termination sequence 

<220> 

<221> promoter 
<222> (1) . . (1145) 

<220> 

<221> transit^ peptide 
<222> (1179) - . (1347) 

<220> 
<221> CDS 

<222> (1179) . . (2217) 

f <220> 

<221> terminator 
<222> (2273) . . (2536) 

<400> 1 



aagctttctt 


catcggtgat 


tgattccttt 


aaagacttat 


gtttcttatc 


ttgcttctga 


60 


ggcaagtatt 


cagttaccag 


ttaccactta 


tattctggac 


tttctgactg 


catcctcatt 


120 


tttccaacat 


tttaaatttc 


actattggct 


gaatgcttct 


tctttgagga 


agaaacaatt 


180 


cagatggcag 


aaatgtatca 


accaatgcat 


atatacaaat 


gtacctcttg 


ttctcaaaac 


240 


atctatcgga 


tggttccatt 


tgctttgtca 


tccaattagt 


gactacttta 


tattattcac 


300 


tcctctttat 


tactattttc 


atgcgaggtt 


gccatgtaca 


ttatatttgt 


aaggattgac 


360 


gctattgagc 


gtttttcttc 


aattttcttt 


attttagaca 


tgggtatgaa 


atgtgtgtta 


420 


gagttgggtt 


gaatgagata 


tacgttcaag 


tgaagtggca 


taccgttctc 


gagtaaggat 


480 


gacctaccca 


ttcttgagac 


aaatgttaca 


ttttagtatc 


agagtaaaat 


gtgtacctat 


540 
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aactcaaatt cgattgacat gtatccattc aacataaaat taaaccagcc tgcacctgca 600 

tccacatttc aagtattttc aaaccgttcg gctcctatcc accgggtgta acaagaegga 660 

ttccgaattt ggaagatttt gactcaaatt cccaatttat attgacegtg actaaatcaa 72 0 

ctttaacttc tataattctg attaagctcc caatttatat tcccaacggc actacctcca 780 

aaatttatag actctcatcc ccttttaaac caacttagta aacgtttttt tttttaattt 840 

tatgaagtta agtttttacc ttgtttttaa aaagaatcgt tcataagatg ccatgccaga 900 

acattagcta cacgttacac atagcatgea gecgeggaga attgtttttc ttcgccactt 960 

gtcactccct tcaaacacct aagagcttct ctctcacagc acacacatac aatcacatgc 1020 

gtgeatgeat tattacacgt gatcgecatg caaatctcct ttatagecta taaattaact 1080 

catccgcttc actctttact caaaccaaaa ctcatcaata caaacaagat taaaaacata 1140 

cacgaggatc ctcagtcaca caaagagtaa agaagaaca atg get tec tct atg 1194 

Met Ala Ser Ser Met 
1 5 

etc tct tec get act atg gtt gee tct ccg get cag gee act atg gtc 1242 
Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala Gin Ala Thr Met Val 
10 15 20 

get cct ttc aac gga ctt aag tec tec get gee ttc cca gee ace cgc 12 90 
Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala Phe Pro Ala Thr Arg 
25 30 35 

aag get aac aac gac att act tec ate aca age aac ggc gga cgc gtt 13 3 8 
Lys Ala Asn Asn Asp lie Thr Ser He Thr Ser Asn Gly Gly Arg Val 
40 45 50 

aac tgc atg tct aga atg cca tec gag teg tea gac gca get cgt cct 13 86 
Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro 
55 60 65 

gcg eta aag cac gee tac aaa cct cca gca tct gac gee aag ggc ate 1434 
Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly He 
70 75 80 85 

acg atg gcg ctg ace ate att ggc ace tgg ace gca gtg ttt tta cac 14 82 
Thr Met Ala Leu Thr He He Gly Thr Trp Thr Ala Val Phe Leu His 
90 95 100 

gca ata ttt caa ate agg eta ccg aca tec atg gac cag ctt cac tgg 153 0 
Ala He Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His Trp 
105 110 115 

ttg cct gtg tec gaa gee aca gee eag ctt ttg ggc gga age age age 1578 
Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser 
120 125 130 

eta ctg cac ate get gca gtc ttc att gta ctt gag ttc ctg tac act 1626 
Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr Thr 
135 140 145 
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ggt eta ttc ate ace aca cat gac gca atg cat ggc ace ata get ttg 1674 
Gly Leu Phe lie Thr Thr His Asp Ala Met His Gly Thr lie Ala Leu 
150 155 160 165 

agg cac agg cag etc aat gat etc ctt ggc aac ate tgc ata tea ctg 1722 
Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn lie Cys He Ser Leu 
170 175 180 

tac gee tgg ttt gac tac age atg ctg cat cgc aag cac tgg gag cac 1770 
Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg Lys His Trp Glu His 
185 190 195 

cac aac cat act ggc gaa gtg ggg aaa gac cct gac ttc cac aag gga 1818 
His Asn His Thr Gly Glu Val Gly Lys Asp Pro Asp Phe His Lys Gly 
200 205 210 

aat ccc ggc ctt gtc ccc tgg ttc gee age ttc atg tec age tac atg 1866 
Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr Met 
215 220 225 

tec ctg tgg cag ttt gee egg ctg gca tgg tgg gca gtg gtg atg caa 1914 
Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala Val Val Met Gin 
230 235 240 245 

atg ctg ggg gcg ccc atg gca aat etc eta gtc ttc atg get gca gee 1962 
Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala 
250 255 260 

cca ate ttg tea gca ttc cgc etc ttc tac ttc ggc act tac ctg cca 2010 
Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro 
265 270 275 

cac aag cct gag cca ggc cct gca gca ggc tct cag gtg atg gec tgg 2 05 8 
His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp 
280 285 290 

ttc agg gee aag aca agt gag gca tct gat gtg atg agt ttc ctg aca 2106 
Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr 
295 300 305 

tgc tac cac ttt gac ctg cac tgg gag cac cac aga tgg ccc ttt gee 2154 
Cys Tyr His Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala 
310 315 320 325 

ccc tgg tgg cag ctg ccc cac tgc cgc cgc ctg tec ggg cgt ggc ctg 22 02 
Pro Trp Trp Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu 
330 335 340 

gtg cct gec ttg gca tgacctggtc cctccgctgg tgacccagcg tetgeacaag 2257 
Val Pro Ala Leu Ala 
345 

agtgtcatgg agctcgaatt tccccgatcg ttcaaacatt tggcaataaa gtttcttaag 2317 

attgaatcct gttgccggtc ttgegatgat tatcatataa tttctgttga attacgttaa 2377 

gcatgtaata attaacatgt aatgeatgae gttatttatg agatgggttt ttatgattag 2437 

agtcccgcaa ttatacattt aatacgegat agaaaacaaa atatagegeg caaactagga 2497 

taaattatcg cgcgcggtgt catctatgtt actagategg gaattc 2543 
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<210> 2 
<211> 346 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: deduced fusion protein of 

transit peptide + peptide with beta- carotene C-4 oxygenase activity 

<400> 2 

Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 

15 10 is 

Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp lie Thr Ser lie Thr Ser 
35 40 45 

Asn Gly Gly Arg Val Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser 
50 55 ^ 60 

Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser 
65 70 75 80 

Asp Ala Lys Gly lie Thr Met Ala Leu Thr lie lie Gly Thr Trp Thr 
85 90 95 

Ala Val Phe Leu His Ala lie Phe Gin lie Arg Leu Pro Thr Ser Met 
100 105 110 

Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu 
115 120 125 

Gly Gly Ser Ser Ser Leu Leu His lie Ala Ala Val Phe lie Val Leu 
130 135 140 

Glu Phe Leu Tyr Thr Gly Leu Phe lie Thr Thr His Asp Ala Met His 
145 150 155 160 

Gly Thr lie Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn 
165 170 175 

lie Cys lie Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg 
180 185 190 

Lys His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro 
195 200 205 

Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe 
210 215 220 

Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp 
225 230 235 240 

Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val 
245 250 255 
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Phe Met 



Ala Ala Ala Pro He Leu Ser Ala 
260 265 



Phe Arg Leu Phe Tyr Phe 
270 



Gly Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser 
275 280 285 

Gin Val Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val 
290 295 300 

Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu His His 
305 310 315 320 

Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu Pro His Cys Arg Arg Leu 
325 330 335 

Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 



340 



345 



SEQtTETKrCE LISTING 

<1 10> AstaCarotene AB 



<120> DNA construct and its use 

<130> 29295-AstaCarotene 

<140> 
<141> 

<160> 2 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 2543 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: napin promoter 

+ choroplast localization signal + beta-carotene C-4 oxygenase 
coding sequence + termination sequence 

<220> 

<221> promoter 
<222> (1) . . (1145) 

<220> 

<221> transit_peptide 
<222> (1179) . . (1347) 

<220> 
<221> CDS 

<222> (1179) . . (2217) 
<220> 

<221> terminator 
<222> (2273) . . (2536) 



<400> 1 
aagctttctt 


catcggtgat 


tgattccttt 


aaagacttat 


gtttcttatc 


ttgcttctga 


60 


ggcaagtatt 


cagttaccag 


ttaccactta 


tat tctggac 


tttctgactg 


catcctcatt 


120 


tttccaacat 


tttaaatttc 


actattggct 


gaatgcttct 


tctttgagga 


agaaacaatt 


180 


cagatggcag 


aaatgtatca 


accaatgcat 


atatacaaat 


gtacctcttg 


ttctcaaaac 


240 


atctatcgga 


tggttccatt 


tgctttgtca 


tccaattagt 


gactacttta 


tattattcac 


300 


tcctctttat 


tactattttc 


atgcgaggtt 


gccatgtaca 


ttatatttgt 


aaggattgac 


360 


gctattgagc 


gtttttcttc 


aattttcttt 


attttagaca 


tgggtatgaa 


atgtgtgtta 


420 


gagttgggtt 


gaatgagata 


tacgttcaag 


tgaagtggca 


taccgttctc 


gagtaaggat 


480 


gacctaccca 


ttcttgagac 


aaatgttaca 


ttttagtatc 


agagtaaaat 


gtgtacctat 


540 


aactcaaatt 


cgattgacat 


gtatccattc 


aacataaaat 


taaaccagcc 


tgcacctgca 


600 


tccacatttc 


aagtattttc 


aaaccgttcg 


gctcctatcc 


accgggtgta 


acaagacgga 


660 
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ttccgaattt 


ggaagatttt 


gactcaaatt 


cccaatttat 


attgacegtg 


actaaatcaa 


720 


ctttaacttc 


tataattctg 


attaagctcc 


caatttatat 


tcccaacggc 


actacctcca 


780 


aaatttatag 


actctcatcc 


ccttttaaac 


caacttagta 


aacgtttttt 


tttttaattt 


840 


tatgaagtta 


agtttttacc 


ttgtttttaa 


aaagaatcgt 


tcataagatg 


ccatgccaga 


900 


acattagcta 


cacgttacac 


atagcatgca 


gccgcggaga 


attgtttttc 


ttcgccactt 


960 


gtcactccct 


tcaaacacct 


aagagcttct 


ctctcacagc 


acacacatac 


aatcacatgc 


1020 


gtgcatgcat 


tattacacgt 


gatcgccatg 


caaatctcct 


ttatagecta 


taaattaact 


1080 


catccgcttc 


actctttact 


caaaccaaaa 


ctcatcaata 


caaacaagat 


taaaaacata 


1140 


cacgaggatc 


ctcagtcaca 


caaagagtaa 


agaagaaca atg get tec 
Met Ala Ser 


tct atg 
Ser Met 


1194 



etc tct tec get act atg gtt gee tct ccg get cag gec act atg gtc 1242 

Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala Gin Ala Thr Met Val 
10 " 15 20 

get cct ttc aac gga ctt aag tec tec get gee ttc cca gee ace cgc 1290 

Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala Phe Pro Ala Thr Arg 
25 30 35 

aag get aac aac gac att act tec ate aca age aac ggc gga cgc gtt " 1338 

Lys Ala Asn Asn Asp lie Thr Ser lie Thr Ser Asn Gly Gly Arg Val 
40 45 50 

aac tgc atg tct aga atg cca tec gag teg tea gac gca get cgt cct 1386 

Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro 
55 60 65 

gcg eta aag cac gec tac aaa cct cca gca tct gac gee aag ggc ate 1434 

Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly lie 

70 75 80 85 

acg atg gcg ctg ace ate att ggc ace tgg ace gca gtg ttt tta cac 1482 

Thr Met Ala Leu Thr He He Gly Thr Trp Thr Ala Val Phe Leu His 
90 95 100 

gca ata ttt caa ate agg eta ccg aca tec atg gac cag ctt cac tgg 1530 

Ala lie Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His Trp 
105 110 115 

ttg cct gtg tec gaa gee aca gee cag ctt ttg ggc gga age age age 1578 

Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser 
120 125 130 

eta ctg cac ate get gca gtc ttc att gta ctt gag ttc ctg tac act 1626 

Leu Leu His lie Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr Thr 
135 140 145 
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ggt eta ttc ate acc aca cat gac gca atg cat ggc acc ata get ttg 1674 
Gly Leu Phe lie Thr Thr His Asp Ala Met His Gly Thr lie Ala Leu 
150 155 160 165 

agg cac agg cag etc aat gat etc ctt ggc aac ate tgc ata tea ctg 1722 
Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn lie Cys lie Ser Leu 
170 175 " 180 

tac gee tgg ttt gac tac age atg ctg cat cgc aag cac tgg gag cac 1770 
Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg Lys His Trp Glu His 
185 ' ' 190 * 195 

cac aac cat act ggc gaa gtg ggg aaa gac cct gac ttc cac aag gga 1818 
His Asn His Thr Gly Glu Val Gly Lys Asp Pro Asp Phe His Lys Gly 
200 205 210 

aat ccc ggc ctt gtc ccc tgg ttc gee age ttc atg tec age tac atg 1866 
Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr Met 
215 ~ 220 225 

tec ctg tgg cag ttt gee egg ctg gca tgg tgg gca gtg gtg atg caa 1914 
Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala Val Val Met Gin 
230 235 " 240 245 

atg ctg ggg gcg ccc atg gca aat etc eta gtc ttc atg get gca gee 1962 
Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala 
250 255 260 

cca ate ttg tea gca ttc cgc etc ttc tac ttc ggc act tac ctg cca 2010 
Pro lie Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro 
265 270 275 

cac aag cct gag cca ggc cct gca gca ggc tct cag gtg atg gee tgg 2058 
His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp 
280 285 290 

ttc agg gee aag aca agt gag gca tct gat gtg atg agt ttc ctg aca 2106 
Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr 
295 300 305 

tgc tac cac ttt gac ctg cac tgg gag cac cac aga tgg ccc ttt gee 2154 
Cys Tyr His Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala 
310 ^ 315 320 325 

ccc tgg tgg cag ctg ccc cac tgc cgc cgc ctg tec ggg cgt ggc ctg 2202 
Pro Trp Trp Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu 
330 ' 335 340 

gtg cct gec ttg gca tgacctggtc cctccgctgg tgacccagcg tetgeacaag 2257 
Val Pro Ala Leu Ala 
345 

agtgtcatgg agctcgaatt tccccgatcg ttcaaacatt tggcaataaa gtttcttaag 2317 
attgaatcct gttgccggtc ttgegatgat tatcatataa tttctgttga attacgttaa 2377 
gcatgtaata attaacatgt aatgeatgae gttatttatg agatgggttt ttatgattag 2437 
agtcccgcaa ttatacattt aatacgegat agaaaacaaa atatagegeg caaactagga 24 97 
taaattatcg cgcgcggtgt catctatgtt actagategg gaattc 2543 



<210> 2 
<211> 346 
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<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: deduced fusion protein of 

transit peptide + peptide with beta-carotene C-4 oxygenase activity 

<400> 2 

Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 
1 5 10 15 

Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp lie Thr Ser He Thr Ser 
35 40 45 

Asn Gly Gly Arg Val Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser 
50 55 60 

Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser 
65 70 75 80 

Asp Ala Lys Gly He Thr Met Ala Leu Thr He He Gly Thr Trp Thr 
85 90 95 

Ala Val Phe Leu His Ala He Phe Gin He Arg Leu Pro Thr Ser Met 
100 105 110 

Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu 
115 120 125 

Gly Gly Ser Ser Ser Leu Leu His He Ala Ala Val Phe He Val Leu 
130 135 140 

Glu Phe Leu Tyr Thr Gly Leu Phe He Thr Thr His Asp Ala Met His 
145 150 155 160 

Gly Thr He Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn 
165 170 175 

He Cys He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg 
180 185 * 190 

Lys His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro 
195 200 205 

Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe 
210 215 220 

Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp 
225 " 230 235 240 

Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val 
245 250 255 
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Phe Met Ala Ala Ala Pro lie Leu Ser Ala Phe Arg Leu Phe Tyr Phe 
260 265 270 

Gly Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser 
275 280 285 

Gin Val Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val 
290 295 300 

Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu His His 
305 310 315 320 

Arg Trp Pro Phe Ala 'Pro Trp Trp Gin Leu Pro His Cys Arg Arg Leu 
325 330 335 



Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 
340 345 



